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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: TSUJI, SHOJI 

SANPEI, KAZUHIRO 

(ii) TITLE OF INVENTION: cDNA FRAGMENT OF CAUSATIVE GENE OF 
SPINOCEREBELLAR ATAXIA TYPE 2 

(iii) NUMBER OF SEQUENCES : 13 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Birch, Stewart, Kolasch & Birch, LLP 

(B) STREET: P.O. Box 747 

(C) CITY: Falls Church 

(D) STATE: VA 

(E) COUNTRY: USA 

(F) ZIP: 22040-0747 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER : IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: MURPHY Jr., Gerald M. 

(B) REGISTRATION NUMBER: 28,977 

(C) REFERENCE/DOCKET NUMBER: 0760-0248P 

(ix) TELECOMMUNICATION INFORMATION; 

(A) TELEPHONE: (703) 205-8000 

(B) TELEFAX: (703) 205-8050 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4367 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 49.. 3987 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
TATCCGCACC TCCGCTCCCA CCCGGCGCCT CGGCGCGCCC GCCCTCCG ATG CGC TCA 57 
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Met Arg Ser 
1 

GCG GCC GCA GCT CCT CGG AGT CCC GCG GTG GCC ACC GAG TCT CGC CGC 105 

Ala Ala Ala Ala Pro Arg Ser Pro Ala Val Ala Thr Glu Ser Arg Arg 
5 10 15 

TTC GCC GCA GCC AGG TGG CCC GGG TGG CGC TCG CTC CAG CGG CCG GCG 153 

Phe Ala Ala Ala Arg Trp Pro Gly Trp Arg Ser Leu Gin Arg Pro Ala 
20 25 30 35 

CGG CGG AGC GGG CGG GGC GGC GGT GGC GCG GCC CCG GGA CCG TAT CCC 2 01 

Arg Arg Ser Gly Arg Gly Gly Gly Gly Ala Ala Pro Gly Pro Tyr Pro 

40 45 50 

TCC GCC GCC CCT CCC CCG CCC GGC CCC GGC CCC CCT CCC TCC CGG CAG 2 49 

Ser Ala Ala Pro Pro Pro Pro Gly Pro Gly Pro Pro Pro Ser Arg Gin 
55 60 65 

AGC TCG CCT CCC TCC GCC TCA GAC TGT TTT GGT AGC AAC GGC AAC GGC 2 97 

Ser Ser Pro Pro Ser Ala Ser Asp Cys Phe Gly Ser Asn Gly Asn Gly 
70 75 80 

GGC GGC GCG TTT CGG CCC GGC TCC CGG CGG CTC CTT GGT CTC GGC GGG 34 5 

Gly Gly Ala Phe Arg Pro Gly Ser Arg Arg Leu Leu Gly Leu Gly Gly 
85 90 95 

CCT CCC CGC CCC TTC GTC GTC GTC CTT CTC CCC CTC GCC AGC CCG GGC 3 93 

Pro Pro Arg Pro Phe Val Val Val Leu Leu Pro Leu Ala Ser Pro Gly 
100 105 110 115 

GCC CCT CCG GCC GCG CCA ACC CGC GCC TCC CCG CTC GGC GCC CGT GCG 4 41 

Ala Pro Pro Ala Ala Pro Thr Arg Ala Ser Pro Leu Gly Ala Arg Ala 

120 125 130 

TCC CCG CCG CGT TCC GGC GTC TCC TTG GCG CGC CCG GCT CCC GGC TGT 489 

Ser Pro Pro Arg Ser Gly Val Ser Leu Ala Arg Pro Ala Pro Gly Cys 
135 140 145 

CCC CGC CCG GCG TGC GAG CCG GTG TAT GGG CCC CTC ACC ATG TCG CTG 537 

Pro Arg Pro Ala Cys Glu Pro Val Tyr Gly Pro Leu Thr Met Ser Leu 
150 155 160 

AAG CCC CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAA 5 85 

Lys Pro Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin 
165 W 170 175 

CAG CAG CAG CAG CAG CAG CAG CAG CAG CCG CCG CCC GCG GCT GCC AAT 633 

Gin Gin Gin Gin Gin Gin Gin Gin Gin Pro Pro Pro Ala Ala Ala Asn 
180 185 190 195 

GTC CGC AAG CCC GGC GGC AGC GGC CTT CTA GCG TCG CCC GCC GCC GCG 681 
Val Arg Lys Pro Gly Gly Ser Gly Leu Leu Ala Ser Pro Ala Ala Ala 

200 205 210 

CCT TCG CCG TCC TCG TCC TCG GTC TCC TCG TCC TCG GCC'ACG GCT CCC 729 

Pro Ser Pro Ser Ser Ser Ser Val Ser Ser Ser Ser Ala Thr Ala Pro 
215 220 225 

TCC TCG GTG GTC GCG GCG ACC TCC GGC GGC GGG AGG CCC GGC CTG GGC 777 
Ser Ser Val Val Ala Ala Thr Ser Gly Gly Gly Arg Pro Gly Leu Gly 
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230 235 240 

AGA GGT CGA AAC AGT AAC MA GGA CTG CCT CAG TCT ACG ATT TCT TTT 8 25 

Arg Gly Arg Asn Ser Asn Lys Gly Leu Pro Gin Ser Thr lie Ser Phe 

245 250 255 

GAT GGA ATC TAT GCA AAT ATG AGG ATG GTT CAT ATA CTT ACA TCA GTT 873 

Asp Gly lie Tyr Ala Asn Met Arg Met Val His lie Leu Thr Ser Val 

260 265 270 275 

GTT GGC TCC AAA TGT GAA GTA CAA GTG AAA AAT GGA GGT ATA TAT GAA 921 

Val Gly Ser Lys Cys Glu Val Gin Val Lys Asn Gly Gly He Tyr Glu 
280 285 290 

GGA GTT TTT AAA ACT TAC AGT CCG AAG TGT GAT TTG GTA CTT GAT GCC 9 69 

Gly Val Phe Lys Thr Tyr Ser Pro Lys Cys Asp Leu Val Leu Asp Ala 
295 300 305 

GCA CAT GAG AAA AGT ACA GAA TCC AGT TCG GGG CCG AAA CGT GAA GAA 1017 

Ala His Glu Lys Ser Thr Glu Ser Ser Ser Gly Pro Lys Arg Glu Glu 

310 315 320 

ATA ATG GAG AGT ATT TTG TTC AAA TGT TCA GAC TTT GTT GTG GTA CAG 10 65 

He Met Glu Ser He Leu Phe Lys Cys Ser Asp Phe Val Val Val Gin 

325 330 335 

TTT AAA GAT ATG GAC TCC AGT TAT GCA AAA AGA GAT GCT TTT ACT GAC 1113 

Phe Lys Asp Met Asp Ser Ser Tyr Ala Lys Arg Asp Ala Phe Thr Asp 

340 345 350 355 

TCT GCT ATC AGT GCT AAA GTG AAT GGC GAA CAC AAA GAG AAG GAC CTG 1161 

Ser Ala He Ser Ala Lys Val Asn Gly Glu His Lys Glu Lys Asp Leu 
360 365 370 

GAG CCC TGG GAT GCA GGT GAA CTC ACA GCC AAT GAG GAA CTT GAG GCT 12 0 9 

Glu Pro Trp Asp Ala Gly Glu Leu Thr Ala Asn Glu Glu Leu Glu Ala 
375 380 385 

TTG GAA AAT GAC GTA TCT AAT GGA TGG GAT CCC AAT GAT ATG TTT CGA 1257 

Leu Glu Asn Asp Val Ser Asn Gly Trp Asp Pro Asn Asp Met Phe Arg 

390 395 400 

TAT AAT GAA GAA AAT TAT GGT GTA GTG TCT ACG TAT GAT AGC AGT TTA 1305 

Tyr Asn Glu Glu Asn Tyr Gly Val Val Ser Thr Tyr Asp Ser Ser Leu 

405 410 415 

TCT TCG TAT ACA GTG CCC TTA GAA AGA GAT AAC TCA GAA GAA TTT TTA 1353 

Ser Ser Tyr Thr Val Pro Leu Glu Arg Asp Asn Ser Glu Glu Phe Leu 

420 425 430 435 

AAA CGG GAA GCA AGG GCA AAC CAG TTA GCA GAA GAA ATT GAG TCA AGT 14 01 

Lys Arg Glu Ala Arg Ala Asn Gin Leu Ala Glu Glu He Glu Ser Ser 
440 445 450 

GCC CAG TAC AAA GCT CGA GTG GCC CTG GAA AAC GAT GAT AGG AGT GAG 14 4 9 

Ala Gin Tyr Lys Ala Arg Val Ala Leu Glu Asn Asp Asp Arg Ser Glu 
455 460 465 

GAA GAA AAA TAC ACA GCA GTT CAG AGA AAT TCC AGT GAA CGT GAG GGG 14 97 

Glu Glu Lys Tyr Thr Ala Val Gin Arg Asn Ser Ser Glu Arg Glu Gly 

470 475 480 
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CAC AGC ATA AAC ACT AGG GAA AAT AAA TAT ATT CCT CCT GGA CAA AGA 15 4 5 

His Ser lie Asn Thr Arg Glu Asn Lys Tyr lie Pro Pro Gly Gin Arg 

485 490 495 

AAT AGA GAA GTC ATA TCC TGG GGA AGT GGG AGA CAG AAT TCA CCG CGT 15 93 

Asn Arg Glu Val lie Ser Trp Gly Ser Gly Arg Gin Asn Ser Pro Arg 
500 505 510 515 

ATG GGC CAG CCT GGA TCG GGC TCC ATG CCA TCA AGA TCC ACT TCT CAC 1641 

Met Gly Gin Pro Gly Ser Gly Ser Met Pro Ser Arg Ser Thr Ser His 

520 525 530 

ACT TCA GAT TTC AAC CCG AAT TCT GGT TCA GAC CAA AGA GTA GTT AAT 168 9 

Thr Ser Asp Phe Asn Pro Asn Ser Gly Ser Asp Gin Arg Val Val Asn 

535 540 545 

GGA GGT GTT CCC TGG CCA TCG CCT TGC CCA TCT CCT TCC TCT CGC CCA 17 37 

Gly Gly Val Pro Trp Pro Ser Pro Cys Pro Ser Pro Ser Ser Arg Pro 

550 555 560 

CCT TCT CGC TAC CAG TCA GGT CCC AAC TCT CTT CCA CCT CGG GCA GCC 17 85 

Pro Ser Arg Tyr Gin Ser Gly Pro Asn Ser Leu Pro Pro Arg Ala Ala 

565 570 575 

ACC CCT ACA CGG CCG CCC TCC AGG CCC CCC TCG CGG CCA TCC AGA CCC 18 33 

Thr Pro Thr Arg Pro Pro Ser Arg Pro Pro Ser Arg Pro Ser Arg Pro 
580 585 590 595 

CCG TCT CAC CCC TCT GCT CAT GGT TCT CCA GCT CCT GTC TCT ACT ATG 18 81 

Pro Ser His Pro Ser Ala His Gly Ser Pro Ala Pro Val Ser Thr Met 

600 605 610 

CCT AAA CGC ATG TCT TCA GAA GGG CCT CCA AGG ATG TCC CCA AAG GCC 192 9 

Pro Lys Arg Met Ser Ser Glu Gly Pro Pro Arg Met Ser Pro Lys Ala 

615 620 625 

CAG CGA CAT CCT CGA AAT CAC AGA GTT TCT GCT GGG AGG GGT TCC ATA 1977 

Gin Arg His Pro Arg Asn His Arg Val Ser Ala Gly Arg Gly Ser lie 

630 635 640 

TCC AGT GGC CTA GAA TTT GTA TCC CAC AAC CCA CCC AGT GAA GCA GCT 2025 

Ser Ser Gly Leu Glu Phe Val Ser His Asn Pro Pro Ser Glu Ala Ala 

645 650 655 

ACT CCT CCA GTA GCA AGG ACC AGT CCC TCG GGG GGA ACG TGG TCA TCA 2073 

Thr Pro Pro Val Ala Arg Thr Ser Pro Ser Gly Gly Thr Trp Ser Ser 
660 665 670 675 

GTG GTC AGT GGG GTT CCA AGA TTA TCC CCT AAA ACT CAT AGA CCC AGG 2121 

Val Val Ser Gly Val Pro Arg Leu Ser Pro Lys Thr His Arg Pro Arg 

680 685 690 

TCT CCC AGA CAG AAC AGT ATT GGA AAT ACC CCC AGT GGG CCA GTT CTT 2169 

Ser Pro Arg Gin Asn Ser lie Gly Asn Thr Pro Ser Gly Pro Val Leu 

695 700 705 

GCT TCT CCC CAA GCT GGT ATT ATT CCA ACT GAA GCT GTT GCC ATG CCT 2217 

Ala Ser Pro Gin Ala Gly lie lie Pro Thr Glu Ala Val Ala Met Pro 

710 715 720 
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ATT CCA GCT GCA TCT CCT ACG CCT GCT AGT CCT GCA TCG AAC AGA GCT 22 65 

lie Pro Ala Ala Ser Pro Thr Pro Ala Ser Pro Ala Ser Asn Arg Ala 
725 730 735 

GTT ACC CCT TCT AGT GAG GCT AAA GAT TCC AGG CTT CAA GAT CAG AGG 2 313 

Val Thr Pro Ser Ser Glu Ala Lys Asp Ser Arg Leu Gin Asp Gin Arg 

740 745 750 755 

CAG AAC TCT CCT GCA GGG AAT AAA GAA AAT ATT AAA CCC AAT GAA ACA 23 61 

Gin Asn Ser Pro Ala Gly Asn Lys Glu Asn lie Lys Pro Asn Glu Thr 

760 765 770 

TCA CCT AGC TTC TCA AAA GCT GAA AAC AAA GGT ATA TCA CCA GTT GTT 24 09 

Ser Pro Ser Phe Ser Lys Ala Glu Asn Lys Gly lie Ser Pro Val Val 

775 780 785 

TCT GAA CAT AGA AAA CAG ATT GAT GAT TTA AAG AAA TTT AAG AAT GAT 2 4 57 

Ser Glu His Arg Lys Gin lie Asp Asp Leu Lys Lys Phe Lys Asn Asp 

790 795 800 

TTT AGG TTA CAG CCA AGT TCT ACT TCT GAA TCT ATG GAT CAA CTA CTA 25 05 

Phe Arg Leu Gin Pro Ser Ser Thr Ser Glu Ser Met Asp Gin Leu Leu 
805 810 815 

AAC AAA AAT AGA GAG GGA GAA AAA TCA AGA GAT TTG ATC AAA GAC AAA 2553 

Asn Lys Asn Arg Glu Gly Glu Lys Ser Arg Asp Leu lie Lys Asp Lys 

820 825 830 835 

ATT GAA CCA AGT GCT AAG GAT TCT TTC ATT GAA AAT AGC AGC AGC AAC 2 601 

lie Glu Pro Ser Ala Lys Asp Ser Phe lie Glu Asn Ser Ser Ser Asn 

840 845 850 

TGT ACC AGT GGC AGC AGC AAG CCG AAT AGC CCC AGC ATT TCC CCT TCA 2 64 9 

Cys Thr Ser Gly Ser Ser Lys Pro Asn Ser Pro Ser lie Ser Pro Ser 

855 860 865 

ATA CTT AGT AAC ACG GAG CAC AAG AGG GGA CCT GAG GTC ACT TCC CAA 2 697 

lie Leu Ser Asn Thr Glu His Lys Arg Gly Pro Glu Val Thr Ser Gin 

870 875 880 

GGG GTT CAG ACT TCC AGC CCA GCA TGT AAA CAA GAG AAA GAC GAT AAG 27 4 5 

Gly Val Gin Thr Ser Ser Pro Ala Cys Lys Gin Glu Lys Asp Asp Lys 
885 890 895 

GAA GAG AAG AAA GAC GCA GCT GAG CAA GTT AGG AAA TCA ACA TTG AAT 27 93 

Glu Glu Lys Lys Asp Ala Ala Glu Gin Val Arg Lys Ser Thr Leu Asn 

900 905 910 915 

CCC AAT GCA AAG GAG TTC AAC CCA CGT TCC TTC TCT CAG CCA AAG CCT 28 41 

Pro Asn Ala Lys Glu Phe Asn Pro Arg Ser Phe Ser Gin Pro Lys Pro 

920 925 930 

TCT ACT ACC CCA ACT TCA CCT CGG CCT CAA GCA CAA CCT AGC CCA TCT 28 8 9 

Ser Thr Thr Pro Thr Ser Pro Arg Pro Gin Ala Gin Pro Ser Pro Ser 

935 940 945 

ATG GTG GGT CAT CAA CAG CCA ACT CCA GTT TAT ACT CAG CCT GTT TGT 2 937 

Met Val Gly His Gin Gin Pro Thr Pro Val Tyr Thr Gin Pro Val Cys 

950 955 960 

TTT GCA CCA AAT ATG ATG TAT CCA GTC CCA GTG AGC CCA GGC GTG CAA 2 985 
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Phe Ala Pro Asn Met Met Tyr Pro Val Pro Val Ser Pro Gly Val Gin 
965 970 975 

CCT TTA TAC CCA ATA CCT ATG ACG CCC ATG CCA GTG AAT CAA GCC AAG 30 33 

Pro Leu Tyr Pro lie Pro Met Thr Pro Met Pro Val Asn Gin Ala Lys 
980 985 990 995 

ACA TAT AGA GCA GTA CCA AAT ATG CCC CAA CAG CGG CAA GAC CAG CAT 30 81 

Thr Tyr Arg Ala Val Pro Asn Met Pro Gin Gin Arg Gin Asp Gin His 
1000 1005 1010 

CAT CAG AGT GCC ATG ATG CAC CCA GCG TCA GCA GCG GGC CCA CCG ATT 312 9 

His Gin Ser Ala Met Met His Pro Ala Ser Ala Ala Gly Pro Pro lie 
1015 1020 1025 

GCA GCC ACC CCA CCA GCT TAC TCC ACG CAA TAT GTT GCC TAC AGT CCT 317 7 

Ala Ala Thr Pro Pro Ala Tyr Ser Thr Gin Tyr Val Ala Tyr Ser Pro 
1030 1035 1040 

CAG CAG TTC CCA AAT CAG CCC CTT GTT CAG CAT GTG CCA CAT TAT CAG 322 5 

Gin Gin Phe Pro Asn Gin Pro Leu Val Gin His Val Pro His Tyr Gin 
1045 1050 1055 

TCT CAG CAT CCT CAT GTC TAT AGT CCT GTA ATA CAG GGT AAT GCT AGA 3273 
Ser Gin His Pro His Val Tyr Ser Pro Val lie Gin Gly Asn Ala Arg 
1060 1065 1070 1075 

ATG ATG GCA CCA CCA ACA CAC GCC CAG CCT GGT TTA GTA TCT TCT TCA 3321 
Met Met Ala Pro Pro Thr His Ala Gin Pro Gly Leu Val Ser Ser Ser 
1080 1085 1090 

GCA ACT CAG TAC GGG GCT CAT GAG CAG ACG CAT GCG ATG TAT GCA TGT 33 69 

Ala Thr Gin Tyr Gly Ala His Glu Gin Thr His Ala Met Tyr Ala Cys 
1095 1100 1105 

CCC AAA TTA CCA TAC AAC AAG GAG ACA AGC CCT TCT TTC TAC TTT GCC 3417 
Pro Lys Leu Pro Tyr Asn Lys Glu Thr Ser Pro Ser Phe Tyr Phe Ala 
1110 1115 1120 

ATT TCC ACG GGC TCC CTT GCT CAG CAG TAT GCG CAC CCT AAC GCT ACC 34 65 

lie Ser Thr Gly Ser Leu Ala Gin Gin Tyr Ala His Pro Asn Ala Thr 
1125 1130 1135 

CTG CAC CCA CAT ACT CCA CAC CCT CAG CCT TCA GCT ACC CCC ACT GGA 3513 
Leu His Pro His Thr Pro His Pro Gin Pro Ser Ala Thr Pro Thr Gly 
1140 1145 1150 1155 

CAG CAG CAA AGC CAA CAT GGT GGA AGT CAT CCT GCA CCC AGT CCT GTT 35 61 

Gin Gin Gin Ser Gin His Gly Gly Ser His Pro Ala Pro Ser Pro Val 
1160 1165 1170 

CAG CAC CAT CAG CAC CAG GCC GCC CAG GCT CTC CAT CTG GCC AGT CCA 360 9 

Gin His His Gin His Gin Ala Ala Gin Ala Leu His Leu Ala Ser Pro 
1175 1180 1185 

CAG CAG CAG TCA GCC ATT TAC CAC GCG GGG CTT GCG CCA ACT CCA CCC 3 657 

Gin Gin Gin Ser Ala lie Tyr His Ala Gly Leu Ala Pro Thr Pro Pro 
1190 1195 1200 

TCC ATG ACA CCT GCC TCC AAC ACG CAG TCG CCA CAG AAT AGT TTC CCA 37 05 

Ser Met Thr Pro Ala Ser Asn Thr Gin Ser Pro Gin Asn Ser Phe Pro 
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1205 1210 1215 

GCA GCA CAA CAG ACT GTC TTT ACG ATC CAT CCT TCT CAC GTT CAG CCG 37 53 

Ala Ala Gin Gin Thr Val Phe Thr lie His Pro Ser His Val Gin Pro 
1220 1225 1230 1235 

GCG TAT ACC AAC CCA CCC CAC ATG GCC CAC GTA CCT CAG GCT CAT GTA 3801 
Ala Tyr Thr Asn Pro Pro His Met Ala His Val Pro Gin Ala His Val 
1240 1245 1250 

CAG TCA GGA ATG GTT CCT TCT CAT CCA ACT GCC CAT GCG CCA ATG ATG 38 4 9 

Gin Ser Gly Met Val Pro Ser His Pro Thr Ala His Ala Pro Met Met 
1255 1260 1265 

CTA ATG ACG ACA CAG CCA CCC GGC GGT CCC CAG GCC GCC CTC GCT CAA 38 97 

Leu Met Thr Thr Gin Pro Pro Gly Gly Pro Gin Ala Ala Leu Ala Gin 
1270 1275 1280 

AGT GCA CTA CAG CCC ATT CCA GTC TCG ACA ACA GCG CAT TTC CCC TAT 394 5 

Ser Ala Leu Gin Pro lie Pro Val Ser Thr Thr Ala His Phe Pro Tyr 
1285 1290 1295 

ATG ACG CAC CCT TCA GTA CAA GCC CAC CAC CAA CAG CAG TTG 398 7 
Met Thr His Pro Ser Val Gin Ala His His Gin Gin Gin Leu 
1300 1305 1310 

TAAGGCTGCC CTGGAGGAAC CGAAAGGCCA AATTCCCTCC TCCCTTCTAC TGCTTCTACC 4 04 7 

AACTGGAAGC ACAGAAAACT AGAATTTCAT TTATTTTGTT TTTAAAATAT ATATGTTGAT 4107 

TTCTTGTAAC ATCCAATAGG AATGCTAACA GTTCACTTGC AGTGGAAGAT ACTTGGACCG 4167 

AGTAGAGGCA TTTAGGAACT TGGGGGCTAT TCCATAATTC CATATGCTGT TTCAGAGTCC 42 2 7 

CGCAGGTACC CCAGCTCTGC TTGCCGAAAC TGGAAGTTAT TTATTTTTTA ATAACCCTTG 42 87 

AAAG T CAT G A ACACATCAGC TAG C AAAAG A AGTAACAAGA GTGATTCTTG CTGCTATTAC 4 34 7 

TGCTAAAAAA AAAAAAAAAA 4367 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1313 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Arg Ser Ala Ala Ala Ala Pro Arg Ser Pro Ala Val Ala Thr Glu 
15 10 15 

Ser Arg Arg Phe Ala Ala Ala Arg Trp Pro Gly Trp Arg Ser Leu Gin 
20 25 30 

Arg Pro Ala Arg Arg Ser Gly Arg Gly Gly Gly Gly Ala Ala Pro Gly 
35 40 45 
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Pro Tyr Pro Ser 
50 

Ser Arg Gin Ser 
65 

Gly Asn Gly Gly 



Leu Gly Gly Pro 
100 

Ser Pro Gly Ala 
115 

Ala Arg Ala Ser 
130 

Pro Gly Cys Pro 
145 

Met Ser Leu Lys 



Gin Gin Gin Gin 
180 

Ala Ala Asn Val 
195 

Ala Ala Ala Pro 
210 

Thr Ala Pro Ser 
225 

Gly Leu Gly Arg 



lie Ser Phe Asp 
260 

Thr Ser Val Val 
275 

lie Tyr Glu Gly 
290 

Leu Asp Ala Ala 
305 

Arg Glu Glu lie 



Val Val Gin Phe 
340 

Phe Thr Asp Ser 
355 

Lys Asp Leu Glu 



Ala Ala Pro Pro 

55 

Ser Pro Pro Ser 
70 

Gly Ala Phe Arg 

85 

Pro Arg Pro Phe 



Pro Pro Ala Ala 
120 

Pro Pro Arg Ser 
135 

Arg Pro Ala Cys 
150 

Pro Gin Gin Gin 
165 

Gin Gin Gin Gin 



Arg Lys Pro Gly 
200 

Ser Pro Ser Ser 
215 

Ser Val Val Ala 
230 

Gly Arg Asn Ser 

245 

Gly lie Tyr Ala 



Gly Ser Lys Cys 
280 

Val Phe Lys Thr 
295 

His Glu Lys Ser 
310 

Met Glu Ser lie 
325 

Lys Asp Met Asp 



Ala lie Ser Ala 
360 

Pro Trp Asp Ala 



Pro Pro Gly Pro 
60 

Ala Ser Asp Cys 
75 

Pro Gly Ser Arg 
90 

Val Val Val Leu 
105 

Pro Thr Arg Ala 



Gly Val Ser Leu 
140 

Glu Pro Val Tyr 
155 

Gin Gin Gin Gin 
170 

Gin Gin Gin Gin 
185 

Gly Ser Gly Leu 



Ser Ser Val Ser 
220 

Ala Thr Ser Gly 
235 

Asn Lys Gly Leu 
250 

Asn Met Arg Met 
265 

Glu Val Gin Val 



Tyr Ser Pro Lys 
300 

Thr Glu Ser Ser 
315 

Leu Phe Lys Cys 
330 

Ser Ser Tyr Ala 
345 

Lys Val Asn Gly 



Gly Glu Leu Thr 



Gly Pro Pro Pro 



Phe Gly Ser Asn 
80 

Arg Leu Leu Gly 
95 

Leu Pro Leu Ala 
110 

Ser Pro Leu Gly 
125 

Ala Arg Pro Ala 



Gly Pro Leu Thr 
160 

Gin Gin Gin Gin 

175 

Pro Pro Pro Ala 
190 

Leu Ala Ser Pro 
205 

Ser Ser Ser Ala 



Gly Gly Arg Pro 
240 

Pro Gin Ser Thr 

255 

Val His lie Leu 
270 

Lys Asn Gly Gly 
285 

Cys Asp Leu Val 



Ser Gly Pro Lys 
320 

Ser Asp Phe Val 
335 

Lys Arg Asp Ala 
350 

Glu His Lys Glu 
365 

Ala Asn Glu Glu 
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370 375 380 

Leu Glu Ala Leu Glu Asn Asp Val Ser Asn Gly Trp Asp Pro Asn Asp 
385 390 395 400 

Met Phe Arg Tyr Asn Glu Glu Asn Tyr Gly Val Val Ser Thr Tyr Asp 
405 410 415 

Ser Ser Leu Ser Ser Tyr Thr Val Pro Leu Glu Arg Asp Asn Ser Glu 
420 425 430 

Glu Phe Leu Lys Arg Glu Ala Arg Ala Asn Gin Leu Ala Glu Glu lie 
435 440 445 

Glu Ser Ser Ala Gin Tyr Lys Ala Arg Val Ala Leu Glu Asn Asp Asp 
450 455 460 

Arg Ser Glu Glu Glu Lys Tyr Thr Ala Val Gin Arg Asn Ser Ser Glu 
465 470 475 480 

Arg Glu Gly His Ser lie Asn Thr Arg Glu Asn Lys Tyr lie Pro Pro 
485 490 495 

Gly Gin Arg Asn Arg Glu Val lie Ser Trp Gly Ser Gly Arg Gin Asn 
500 505 510 

Ser Pro Arg Met Gly Gin Pro Gly Ser Gly Ser Met Pro Ser Arg Ser 
515 520 525 

Thr Ser His Thr Ser Asp Phe Asn Pro Asn Ser Gly Ser Asp Gin Arg 
530 535 540 

Val Val Asn Gly Gly Val Pro Trp Pro Ser Pro Cys Pro Ser Pro Ser 
545 550 555 560 

Ser Arg Pro Pro Ser Arg Tyr Gin Ser Gly Pro Asn Ser Leu Pro Pro 
565 570 575 

Arg Ala Ala Thr Pro Thr Arg Pro Pro Ser Arg Pro Pro Ser Arg Pro 
580 585 590 

Ser Arg Pro Pro Ser His Pro Ser Ala His Gly Ser Pro Ala Pro Val 
595 600 605 

Ser Thr Met Pro Lys Arg Met Ser Ser Glu Gly Pro Pro Arg Met Ser 
610 615 620 

Pro Lys Ala Gin Arg His Pro Arg Asn His Arg Val Ser Ala Gly Arg 
625 630 635 640 

Gly Ser lie Ser Ser Gly Leu Glu Phe Val Ser His Asn Pro Pro Ser 
645 650 655 

Glu Ala Ala Thr Pro Pro Val Ala Arg Thr Ser Pro Ser Gly Gly Thr 
660 665 670 

Trp Ser Ser Val Val Ser Gly Val Pro Arg Leu Ser Pro Lys Thr His 
675 680 685 

Arg Pro Arg Ser Pro Arg Gin Asn Ser lie Gly Asn Thr Pro Ser Gly 
690 695 700 



29 



Pro Val Leu Ala 
705 

Ala Met Pro Tie 



Asn Arg Ala Val 
740 

Asp Gin Arg Gin 
755 

Asn Glu Thr Ser 
770 

Pro Val Val Ser 
785 

Lys Asn Asp Phe 



Gin Leu Leu Asn 

820 

Lys Asp Lys lie 
835 

Ser Ser Asn Cys 
850 

Ser Pro Ser lie 
865 

Thr Ser Gin Gly 



Asp Asp Lys Glu 
900 

Thr Leu Asn Pro 
915 

Pro Lys Pro Ser 
930 

Ser Pro Ser Met 
945 

Pro Val Cys Phe 



Gly Val Gin Pro 
980 

Gin Ala Lys Thr 
995 

Asp Gin His His 
1010 



Ser Pro Gin Ala 
710 

Pro Ala Ala Ser 
725 

Thr Pro Ser Ser 



Asn Ser Pro Ala 
760 

Pro Ser Phe Ser 
775 

Glu His Arg Lys 
790 

Arg Leu Gin Pro 
805 

Lys Asn Arg Glu 



Glu Pro Ser Ala 
840 

Thr Ser Gly Ser 
855 

Leu Ser Asn Thr 
870 

Val Gin Thr Ser 
885 

Glu Lys Lys Asp 



Asn Ala Lys Glu 
920 

Thr Thr Pro Thr 
935 

Val Gly His Gin 
950 

Ala Pro Asn Met 
965 

Leu Tyr Pro lie 



Tyr Arg Ala Val 
100 

Gin Ser Ala Met 
1015 



Gly lie lie Pro 
715 

Pro Thr Pro Ala 
730 

Glu Ala Lys Asp 
745 

Gly Asn Lys Glu 



Lys Ala Glu Asn 
780 

Gin lie Asp Asp 
795 

Ser Ser Thr Ser 
810 

Gly Glu Lys Ser 
825 

Lys Asp Ser Phe 



Ser Lys Pro Asn 
860 

Glu His Lys Arg 
875 

Ser Pro Ala Cys 
890 

Ala Ala Glu Gin 
905 

Phe Asn Pro Arg 



Ser Pro Arg Pro 
940 

Gin Pro Thr Pro 
955 

Met Tyr Pro Val 
970 

Pro Met Thr Pro 
985 

Pro Asn Met Pro 



Met His Pro Ala 
102 



Thr Glu Ala Val 
720 

Ser Pro Ala Ser 
735 

Ser Arg Leu Gin 
750 

Asn lie Lys Pro 
765 

Lys Gly lie Ser 



Leu Lys Lys Phe 
800 

Glu Ser Met Asp 

815 

Arg Asp Leu lie 
830 

lie Glu Asn Ser 
845 

Ser Pro Ser lie 



Gly Pro Glu Val 
880 

Lys Gin Glu Lys 
895 

Val Arg Lys Ser 
910 

Ser Phe Ser Gin 
925 

Gin Ala Gin Pro 



Val Tyr Thr Gin 
960 

Pro Val Ser Pro 
975 

Met Pro Val Asn 
990 

Gin Gin Arg Gin 
1005 

Ser Ala Ala Gly 
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Pro Pro lie Ala Ala Thr Pro Pro Ala Tyr Ser Thr Gin Tyr Val Ala 
1025 1030 1035 104C 

Tyr Ser Pro Gin Gin Phe Pro Asn Gin Pro Leu Val Gin His Val Pro 
1045 1050 1055 

His Tyr Gin Ser Gin His Pro His Val Tyr Ser Pro Val lie Gin Gly 
1060 1065 1070 

Asn Ala Arg Met Met Ala Pro Pro Thr His Ala Gin Pro Gly Leu Val 
1075 1080 1085 

Ser Ser Ser Ala Thr Gin Tyr Gly Ala His Glu Gin Thr His Ala Met 
1090 1095 1100 

Tyr Ala Cys Pro Lys Leu Pro Tyr Asn Lys Glu Thr Ser Pro Ser Phe 
1105 1110 1115 . 112C 

Tyr Phe Ala lie Ser Thr Gly Ser Leu Ala Gin Gin Tyr Ala His Pro 
1125 1130 1135 

Asn Ala Thr Leu His Pro His Thr Pro His Pro Gin Pro Ser Ala Thr 
1140 . 1145 1150 

Pro Thr Gly Gin Gin Gin Ser Gin His Gly Gly Ser His Pro Ala Pro 
1155 1160 1165 

Ser Pro Val Gin His His . Gin His Gin Ala Ala Gin Ala Leu His Leu 
1170 1175 1180 

Ala Ser Pro Gin Gin Gin Ser Ala lie Tyr His Ala Gly Leu Ala Pro 
1185 1190 1195 1201 

Thr Pro Pro Ser Met Thr Pro Ala Ser Asn Thr Gin Ser Pro Gin Asn 
1205 1210 1215 

Ser Phe Pro Ala Ala Gin Gin Thr Val Phe Thr lie His Pro Ser His 
1220 1225 1230 

Val Gin Pro Ala Tyr Thr Asn Pro Pro His Met Ala His Val Pro Gin 
1235 1240 1245 

Ala His Val Gin Ser Gly Met Val Pro Ser His Pro Thr Ala His Ala 
1250 1255 1260 

Pro Met Met Leu Met Thr Thr Gin Pro Pro Gly Gly Pro Gin Ala Ala 
1265 1270 1275 128 

Leu Ala Gin Ser Ala Leu Gin Pro lie Pro Val Ser Thr Thr Ala His 
1285 1290 1295 

Phe Pro Tyr Met Thr His Pro Ser Val Gin Ala His His Gin Gin Gin 



1300 



1305 



1310 



Leu 



(2) INFORMATION FOR SEQ ID NO : 3 : 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 203 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CACCACCAGC AACAGCAACA GCAGCAGCAG CAGCAGCAGC AG C AG C AG C A G C AG C AG C AG 60 

CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG 120 

CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG 180 

CAGCATCACG GAAACTCTGG GCC 2 03 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
CACCACCAGC AACAGCAACA 20 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
GGCCCAGAGT TTCCGTGATG 2 0 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 165 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG 



60 



CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG 



120 



CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG CAGCAGCAGC AGCAG 



165 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CCCTCACCAT GTCGCTGAAG C 21 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
CGACGCTAGA AGGCCGCTG 19 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



CTTGCGGACA TTGGCAGCC 



19 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 : 
TATCCGCAGC TCCGCTCCC 19 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AGCCGGGCCG AAACGCGCCG 2 0 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



TTCTCTCAGC CAAAGCCTTC TACTACC 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
CCTCGGTGTC GCGGCGACTT CC 
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